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Abstract 

In this paper we give lower bounds for the representation of real 
univariate polynomials as sums of powers of degree 1 polynomials. We 
present two families of polynomials of degree d such that the num¬ 
ber of powers that are required in such a representation must be at 
least of order d. This is clearly optimal up to a constant factor. Pre¬ 
vious lower bounds for this problem were only of order n(-\/d), and 
were obtained from arguments based on Wronskian determinants and 
"shifted derivatives." We obtain this improvement thanks to a new 
lower bound method based on Birkhoff interpolation (also known as 
"lacunary polynomial interpolation"). 


1 Introduction 

In this paper we obtain lower bounds for the representation of a univariate 
polynomial / E IR[J7] of degree d under the form: 

l 

/b) = + (1) 

i=l 

where the 7/j are real constants and the exponents Cj nonnegative integers. 

We give two families of polynomials such that the number I of terms 
required in such a representation must be at least of order d. This is clearly 
optimal up to a constant factor. Previous lower bounds for this problem m 
were only of order n{^/d). The polynomials in our first family are of the 
form Hi{x) = Yli=i + with all at nonzero and the Xj’s distinct. We 
show that that they require at least I > k terms whenever k < {d + 2) /4. 
In particular, for k = (d -|- 2)/4 we obtain I = k = (d -|- 2)/4 as a lower 
bound. The polynomials in our second family are of the form H 2 {x) = 
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{x + 1)'^"’“^ — and we show that they require more than (d — l)/2 terms. 
This improves the lower bound for Hi by a factor of 2, but this second 
lower bound applies only when the exponents e* are required to be bounded 
by d (obviously, if larger exponents are allowed we only need two terms to 
represent H 2 ). It is easily shown that every polynomial of degree d can be 
represented with [(d + l)/2] terms. This implies that of all polynomials of 
degree d, H 2 is essentially (up to a small additive constant) the hardest one. 

Our lower bound results are specific to polynomials with real coefficients. 
It would be interesting to obtain similar lower bounds for other fields, e.g., 
hnite fields or the field of complex numbers. As an intermediate step toward 
our lower bound theorems, we obtain a result on the linear independence of 
polynomials which may be of independent interest. 

Theorem 1. Let fi,...,fk G 1R[-A] be k distinct polynomials of the form 
fi{x) = {x + OiY'. Let us denote by nj the number of polynomials of degree 
less than j in this family. 

If ni < 1 and Uj + Uj-i < j for all j, the family (/j) is linearly indepen¬ 
dent. 

We will see later (in Section U Remark 1 1 7p that this theorem is optimal 
up to a small additive constant when d is even, and exactly optimal when d 
is odd. 

Motivation and connection to previous work 

Lower bounds for the representation of univariate polynomials as sums of 
powers of low degree polynomials were recently obtained in HU. We continue 
this line of work by focusing on powers of degree one polynomials. This 
problem is still challenging because the exponents e* may be different from 
d = deg(/), and may be possibly larger than d. The lower bounds obtained 
in m are of order Ll{y/d). We obtain Ll{d) lower bounds with a new method 
based on polynomial interpolation (more on this below). 

The work in m and in the present paper is motivated by recent progress 
in arithmetic circuit complexity. It was shown that strong enough lower 
bounds for circuits of depth four HI El [19] or even depth three mm 
would yield a separation of Valiant’s [20| algebraic complexity classes VP 
and VNP. Moreover, lower bounds for such circuits were obtained thanks to 
the introduction by Neeraj Kayal of the method of shifted partial derivatives, 
see e.g. HHl B 0 HU El El Ej. Some of these lower bounds seem to come 
close to separating VP from VNP, but there is evidence that the method 
of shifted derivatives by itself will not be sufficient to achieve this goal. 
It is therefore desirable to develop new lower bounds methods. We view 
the models studied in im and in the present paper as "test beds" for the 
development of such methods in a fairly simple setting. We note also that (as 
explained above) strong lower bounds in slightly more general models would 
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imply a separation of VP from VNP. Indeed, if the affine function x + yi 
in ([T| are replaced by a multivariate affine functions we obtain the model 
of "depth 3 powering arithmetic circuits.” In general depth 3 arithmetic 
circuits, instead of powers of affine functions we have products of (possibly 
distinct) affine functions. We note that the depth reduction result of |9] 
yields circuits where the number of factors in such products can be much 
larger than the degree of the polynomial represented by the circuit. It is 
therefore quite natural to allow exponents e* > d in ([1]). Likewise, the model 
studied in m is close to depth 4 arithmetic circuits, see im for details. 

Birkhoff interpolation 

As mentioned above, our results are based on polynomial interpolation and 
more precisely on Birkhoff interpolation (also known as "lacunary interpola¬ 
tion"). The most basic form of polynomial interpolation is Lagrange inter¬ 
polation. In a typical Lagrange interpolation problem, one may have to find 
a polynomial g of degree at most 2 satisfying the 3 constraints g{ — l) = 1, 
g{0) = 4, gf(l) =3. At a slightly higher level of generality we find Hermite 
interpolation, where at each point we must interpolate not only values of 
g but also the values of its first few derivatives. As an example, we may 
have to hnd a polynomial g of degree 3 satisfying the 4 constraints g{0) = 1, 
g{l) = 0,g^(l) = —l,g''{l) = 2. Birkhoff interpolation is even more general 
as there may be “holes” in the sequence of derivatives to be interpolated 
at each point. An example of such a problem is: ^(0) = 0, g'{l) = 0, 
g{2) = 5 ” (2) = 0. We have set the right hand side of all constraints to 0 
because the interpolation problems that we need to handle in this paper all 
turn out to be of that form (in general, one may naturally allow nonzero 
values). Our interest is in the existence of a nonzero polynomial of degree at 
most d satisfying the constraints, and more generally in the dimension of the 
solution space. In fact, we need to know whether it has the dimension that 
one would expect by naively couting the number of constraints. Contrary 
to Lagrange or Hermite interpolation in one variable, where the existence of 
a nonzero solution can be easily decided by comparing the number of con¬ 
straints to d -|-1 (the number of coefficients of g) , this is a nontrivial problem 
and a rich theory was developed to address it |18| . Results of the real (as 
opposed to complex) theory of Birkhoff interpolation turn out to be very 
well suited to our lower bound problems. This is the reason why we work 
with real polynomials in this paper. 

The Waring problem 

Any homogenous (multivariate) polynomial / can be written as a sum of 
powers of linear forms. In the Waring problem for polynomials one attempts 
to determine the smallest possible number of powers in such a representa- 
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tion. This number is called the Waring rank of /. Obtaining lower bounds 
from results in polynomial interpolation seems to be a new method in com¬ 
plexity theory, but it may not come as a surprise to experts on the Waring 
problem. Indeed, a major result in this area, the Alexander-Hirschowitz 
theorem m , see |3] for a survey), is usually stated as a result on (multi¬ 
variate, Hermite) polynomial interpolation. Classical work on the Waring 
problem was focused on the Waring rank of generic polynomials, and this 
question was completely answered by Alexander and Hirschowitz. The focus 
on generic polynomials is in sharp contrast with complexity theory, where a 
main goal is to prove lower bounds on the complexity of explicit polynomi¬ 
als (or of explicit Boolean functions in Boolean complexity). A few recent 
papers mm have begun to investigate the Waring rank of specific (or ex¬ 
plicit, in computer science parlance) polynomials such as monomials, sums 
of coprime monomials, the permanent and the determinant. We expect that 
more connections between lower bounds in algebraic complexity, polynomial 
interpolation and the Waring problem will be uncovered in the future. 

Organization of the paper 

In Section [2] we begin a study of the linear independence of polynomials of the 
form (x + yiY^. We show that this problem can be translated into a problem 
of Birkhoff interpolation, and in fact we show that Birkhoff interpolation and 
linear independence are dual problems. In Section [3] we present the notions 
and results on Birkhoff interpolation that are needed for this paper, and we 
use them to prove Theorem [TJ We build on this result to prove our lower 
bound results in Sectional and we discuss their optimality. The lower bound 
problem studied in this paper is over the field of real numbers. In Section [5] 
we briefly discuss the situation in other fields and in particular the field of 
complex numbers. Finally, we give an illustration of our methods in the 
appendix by completely working out a small example. 

2 Prom linear independence to polynomial interpo¬ 
lation 

There is a clear connection between lower bounds for representations of poly¬ 
nomials under form © and linear independence. Indeed, proving a lower 
bound for a polynomial / amounts to showing that / is linearly independent 
from (x -|- ..., (x -|- yiY^ for some I and for any sequence of I pairs 

(yi,ei),..., {yi,ei). Moreover, if the "hard polynomial" / is itself presented 
as a sum of powers of degree 1 polynomials (which is the case in this paper), 
we can obtain a lower bound for / from linear independence results for such 
powers. This motivates the following study. 

Let us denote by [R(i[Ai] the linear subspace of IR[-A] made of polynomials 
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of degree at most d, and by th A:-th order derivative of a polynomial g. 

Proposition 2. Let fi, ■ ■ ■, fk £ be k distinct polynomials of the form 

fi{x) = (x + OiY^. The family {fi)i<i<k is linearly independent if and only 

if 

dim{g' G = 0 for all i} = d + 1 — k. 

Let V be the subspace of lRd[-^] spanned by the fi. The orthogonal V~^ 
of V is the space of linear forms (f G IR£;[X]* such that {(j), f) = 0 for all 
f £ V. We will use the fact that diml/-*- = d + 1 — dimP. We will identify 
Rrf[X] with its dual [Rrf[X]* via the symmetric bilinear form 

d p 

/ i*\ Jk9d-k 

k=0 \k) 

This is reminiscent of Weyl’s unitarily invariant inner product (see e.g. chap¬ 
ter 16 of [5] for a recent exposition) but we provide here a self-contained 
treatment. Poposition [2] follows immediately from the next lemma: 

Lemma 3. The orthogonal f^ of fi is equal to {g G IRrf[X]; g^'^~^'\ai) = 0}. 

Proof. We begin with the case Cj = d. We need to show that for a polynomial 
f{x) = {x + aY, {g, /) = 0 iff g{a) = 0. This follows from the definition of 
{g, f) since by expanding (x -|- aY in powers of x we have 

d 

{g, (x + aY) = ^ gd-ka’^~^ = 9{a)- (2) 

k=0 

Consider now the general case /(x) = (x -|- aY~^ where /c > 0. We will 
show that 

»'*’(“)= ( 3 ) 

thereby completing the proof of the lemma. In order to obtain ([3]) from ([2]) 
we introduce a new variable e and expand in two different ways {g, {x+a+eY) 
in powers of e. From ([2]) we have 

{g, (x -I- o -I- eY) = 5(« + e) = ^ ^ ■ (4) 

fc=o 

On the other hand, since (x -|- a -|- eY = Ylk=o 4" aY~^ we have from 

bilinearity 

(9. (x + a + if) = Y.(t) <9. (5) 

k=0 ^ ^ 

Comparing (jH and ([5]) shows that = (^)(g, (x -|- aY~Y- 
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Since IRrf[X] has dimension d + 1 we must have A: < d + 1 for the fi 
to be linearly independent. More generally, let nj be the number of fi’s 
which are of degree less than j. Again, for the fi to be linearly independent 
we must have rij < j for all j = l,...,d+ 1. The polynomial identity 
(x + 1)^ —(x —1)^—4x = 0 shows that the converse is not true, but Theorem[T] 
from the introduction shows that a weak converse holds true. We will use 
Proposition [2] to prove this theorem at the end of the next section. 

3 Interpolation matrices 

In Birkhoff interpolation we look for a polynomial g E IRd[Ai] satisfying a 
system of linear equations of the form 

= Ci^k- ( 6 ) 

The system may be lacunary, i.e., we may not have an equation in the system 
for every value of i and k. We set = 1 if such an equation appears, and 
Gi,k = 0 otherwise. We arrange this combinatorial data in an interpolation 
matrix E = {ei^k)i<i<m,o<k<n- We assume that the knots xi,... ,Xm are 
distinct. It is usually assumed [18] that \E\ = the number of I’s 

in E, is equal to d + 1 (the number of coefficients of g). Here we will only 
assume that \E\ < d + 1. We can also assume that n < d since g^^'^ = 0 for 
k > d. In the sequel we will in fact assume that n = d: this condition can 
be enforced by adding empty columns to E if necessary. 

Let X = {xi,..., Xm} be the set of knots. When \E\ = d + 1, the pair 
{E,X) is said to be regular if Q has a unique solution for any choice of 
the Ci^k- Finding necessary or sufficient conditions for regularity has been a 
major topic in Birkhoff interpolation |18| . For \E\ < d+1, we may expect Q 
to have a set of solutions of dimension d + 1 — \E\. We therefore extend the 
definition of regularity to this case as follows. 

Definition 4. The pair {E, X) is regular if for any choice of the Ci^k the set 
of solutions of is an affine subspace of dimension d + 1 — 

Note in particular that the set of solutions is nonempty since \E\ < d+1. 
Basic linear algebra provides a link between regularity for different values 
of \E\. 

Proposition 5. Let E be an m x (d 1) interpolation matrix. For an 
interpolation matrix E of the same format, we write E C E if Ci^k = 0 
implies fi^k = 0 (i.e., the set of I’s of F is included in the set of I’s of E). 
If the pair (E, X) is regular and FEE then (F, X) is regular as well. 

Proof. Consider the interpolation problem: 

gW(xi) = a^k for fi^k = 1- 
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The set of solutions T C [Rd[-^] is an affine subspace which is either empty 
or of dimension at least d + 1 — \F\. It cannot be empty since by adding 
|ili| — |T| constraints we can obtain an interpolation problem with a solution 
space of dimension d + 1 — \E\ >0. For the same reason, it cannot be of 
dimension d + 2 — \F\ or more. In this case, by adding \E\ — |T| constraints 
we would obtain an interpolation problem with a solution space of dimension 
at least (d + 2 — |F|) — (|i?| — \E\) = d + 2 — \E\. This is impossible since 
{E,X) is regular. □ 

Another somewhat more succint way of phrasing the above proof is to 
consider the matrix of the linear system defining the affine subset F. Antic¬ 
ipating on Sectional let us denote this matrix by A(E,X). The pair {E,X) 
is regular iff A{E, X) if of full row rank. The rows of A(E, X) are also rows 
of A{E,X), so A{F,X) must be of full row rank if A{E,X) is. 

For an interpolation matrix, the notions of regularity and order regularity 
are classicaly defined m in the case \E\ = d + 1, but the extension to the 
general case |Fi| < d + 1 is straightforward: 

Definition 6. The interpolation matrix E is regular if (E, X) is regular for 
any choice of m knots xi,, Xm- It is order regular if {E, X) is regular for 
any choice of m ordered knots xi < X 2 ... < Xm- 

As an immediate corollary of Proposition [S] we have: 

Corollary 7. Let E,E be two interpolation matriees with E Q E. If E is 
regular (respectively, order regular) then E is also regular (respectively, order 
regular). 

We will give in Theorem [ 8 ] a sufficient condition for order regularity, but 
we first need some additional definitions. Say that an interpolation matrix 
E satisfies the upper Polya condition if for r = 1,... ,d + 1 there are at 
most r I’s in the last r columns of E. If |Fi| = d -|- 1 this is equivalent to 
the Polya condition: there are at least r I’s in the first r columns of E for 
r = 1 ,..., d + 1 . 

Consider a row of an interpolation matrix E. By sequence we mean a 
maximal sequence of consecutive I’s in this row. A sequence containing an 
odd number of I’s is naturally called an odd sequence. A sequence of the zth 
row is supported if there are I’s in E both to the northwest and southwest 
of the first element of the row. More precisely, if (i, k) is the position of the 
first 1 of the sequence, E should contain I’s in positions (ii, ki) and ( 12 , ^ 2 ) 
where ii < i < i 2 , k\ < k and k 2 < k. The following important result 
(Theorem 1.5 in |18) 1 is due to Atkinson and Sharma [3]. 

Theorem 8. Let E be an mx (d+l) interpolation matrix with \E\ = d + 1. 
If E satisfies the Polya condition and contains no odd supported sequence 
then E is order regular. 
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As an example, the interpolation problem corresponding to the polyno¬ 
mial identity (x -|- 1)^ — (x — 1)^ — 4x = 0 is: 

5(-l) =0, 5'(0) =0, 5(1) =0 

where g E IR 2 [-^]- It admits g{x) = x^ — 1 as a nontrivial solution. The 
corresponding interpolation matrix 



satisfies the Polya condition but contains an odd supported sequence in its 
second row. 

Corollary 9. Let E he anmx{d+l) interpolation matrix with \E\ = d+1. 
If E satisfies the Polya condition, then: 

(i) if every odd sequence of E belongs to the first row, to the last row, or 
begins in the first column then E is order regular. 

(a) if every odd sequence of E begins in the first column then E is regular. 

Proof. Part (i) follows from the fact that a sequence which belongs to the 
first row, to the last row, or begins in the first column cannot be supported. 

For part (ii), assume that every odd sequence of E begins in the first 
column and fix m distinct nodes xi,..., Xm- By reordering the Xj’s we obtain 
an increasing sequence x'^ < X 2 < • • • < x(„. Applying the same permutation 
on the rows of E, we obtain an interpolation matrix E'; clearly, the pair 
{E, X) is regular if and only if {E', X') is. The latter pair is regular because 
X' is ordered and (by part (i)) E' is order regular. □ 

We can now prove the main result of this section. 

Theorem 10. Let F be anmx (d-|-l) interpolation matrix. We denote by Nr 
the number of 1 ’s in the last r columns of E. // < 1 and Nr + A^r-i < f 

for r = 2,... ,d + 1 then E is regular. 

Note that the conditions on Nr are a strengthening of the upper Polya 
condition Nr < r. 

Proof. We will add I’s to E so as to obtain a matrix E satisfying the hy¬ 
pothesis of Corollary [9l part (ii). Corollary [7] will then imply that E is 
regular. 

In order to construct E we proceed as follows. First, for every odd 
sequence of E which does not begin in the first column we add a 1 in the 
cell immediately to the left of its first 1. All odd sequences of the resulting 
matrix E' begin in the first column. Moreover, we have added at most W-i 


I’s in the last r columns of F (note that we can add exactly Nj—i I’s when 
the last r — 1 columns contain N^-i sequences of length 1). Since A^i < 1 
and Nr + Nr-i < r, F' satisfies the upper Polya condition. If |P'| = (i +1 we 
set E = F'. This matrix satisfies the Polya condition and its odd sequences 
all begin in the first column, so we can indeed apply Corollary [9] to get that 
E is regular and. Since E F E, hy Corollary O we conclude that E is also 
regular. 

If |T'| < d +1 we need to add more I’s. It suffices to add d+l — \E'\ new 
rows to E' with a 1 in the first column and O’s everywhere else. Denoting by 
E the resulting matrix we clearly have that E satisfies the Polya condition, 
\E\ = d + 1 and its odd sequences begin in the first column, so Corollary [9] 
and Corollary [7] apply here to conclude that E is regular. Note that E and 
F do not have the same format since E has more rows, but we can apply 
Corollary [7] if we first expand E with d + 1 — \F'\ empty rows. □ 

Proof of Theorem [T] 

At this point we have enough knowledge of Birkhoff interpolation to prove 
Theorem [1] In view of Proposition [2] we need to show that the interpolation 
problem 

g,(<i-ei)(Q.) = 0 for f = 1,..., A; 

has a solution space of dimension d + 1 — k. Let E be the corresponding 
interpolation matrix. This matrix contains d+l — k I’s and is of size mx {d+ 
1) for some m < k (we have m = k only when the Oj’s are all distinct). The 
hypothesis on the rij ’s implies that F satisfies the hypothesis of Theorem [TOl 
and the result follows from the regularity of F. 


4 Lower bounds 


System ([^ is a linear system of equations in the coefficients of g. Fol¬ 
lowing |18| . to set up this system it is convenient to work in the basis 
{x^/j^-)o<j<d instead of the standard basis {x^)o<j<d- We denote by A{E, X) 
the matrix of the system in that basis, where as in the previous section E 
denotes the corresponding interpolation matrix and X the set of knots. As 
already pointed out after Proposition [5l the pair (E, X) is regular if and only 
if A[E,X) is of rank |F1|. In our chosen basis, an interpolation constraint of 
the form ([6]) reads: 


E 


j=o 



(j - k)\ 


9j — Ci,k 


where the coefficients go, ■■■ ,gd are the unknowns and we choose as in |18) 
to interpret 1/r! as 0 for r < 0. 
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Proposition 11. Consider a pair {E, X) where E is an interpolation matrix 
of size m X (d + 1) and X a set of m knots. Let Ei be the matrix formed 
of the first r + 1 eolumns of E and E 2 the matrix formed of the remaining 
d — r eolumns. 

Suppose that Ei eontains at most r + 1 I’s and E 2 at most d — r I’s. If 
both pairs (Ei,X), (E 2 ,X) are regular then {E,X) is regular. 

Proof. The case where \Ei\ = r + 1 and \E 2 \ = d — r is treated in Theorem 1.4 
of |18) . Their argument extends to the general case. Indeed, as shown 
in |18) the rank of A{E, X) is at least equal to the sum of the ranks of 
A{Ei,X) and A{E 2 ,X). For the reader’s convenience, we recall from |18| 
that this inequality is due to the fact that A{E,X) can be transformed by a 
permutation of rows into a matrix of the foriq^ 

A{E^,X) 

0 A{E2,X) 

The two matrices A{Ei,X), A{E 2 ,X) are respectively of rank \Ei\ and \E 2 \ 
since the corresponding pairs are assumed to be regular. Thus, A{E, X) is of 
rank at least \E\ = |ilii| + |i? 2 |- This matrix must in fact be of rank exactly 
\E\ since it has \E\ rows, and we conclude that the pair (E, X) is regular. □ 

Lemma 12. For any finite sequence (ui)o<i<n of real numbers with n > 1 
there is an index s G {0,... ,n — 1} such that (ug+t — Us)/t < {un — Uo)/n 
for every t = 1,... ,n — s. 

Proof. Let a = mino<j<n-i(lin — Ui)/(n — i) and let s be an index where the 
minimum is achieved. We have (un — Us)/{n — s) = a < {un — Uo)/n. 

For every t = l,...,n —swe also have {un — Us)/{n — s) < {un—Us+t)/{n — 
s — t), which implies {ug+t — Ug)/t < [un — Us)/{n — s) < {un — uo)/n. □ 

Our lower bound results are easily derived from the following theorem. 

Theorem 13. Consider a polynomial identity of the form: 

k I 

^aiix + Xi^ = '^Pi{x + yiy^ (7) 

i=l i=l 

where the Xi are distinct real constants, the constants Oj are not all zero, the 
fli and yi are arbitrary real constants, and Ci < d for every i. Then we must 
have k -\-l > {d-\- 2 )/ 2 . 

Proof. We assume without loss of generality that the I polynomials {x + yiY^ 
are linearly independent. Indeed, the right-hand side of ([7]) could otherwise 
be rewritten as a linear combination of an independent subfamily, and this 

^We give an example in the appendix. 
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would only decrease 1. Let us also assume that k + I < {d + 2)/2. Then we 
will show that the k + I polynomials {x + Xi)'^, {x + yiY^ must be linearly 
independent. This is clearly in contradiction with ([7|. 

In view of Proposition [2l to show that our k +1 polynomials are linearly 
independent we need to show that the corresponding interpolation problem 
has a solution space of dimension d +1 — k — 1. Let E be the corresponding 
interpolation matrix and X the set of knots: |X| = m where m is the 
number of distinct points in xi,... ,Xk,yi, ■ ■ ■ ,yi', moreover, E is a matrix 
of size m X (d + 1) which contains k + I I’s. We need to show that the pair 
{E,X) is regular. 

Let Nt be the number of I’s in the last t columns of E. We must have 
Ni < 1, or else the independent family (x + yiY^ would contain more than 
one constant polynomial. We can now complete the proof of the theorem 
in the special case where E satisfies the conditions Nt + Nt-i < t for every 
t = 2,... ,d+ 1. Indeed, in this case E is regular by Theorem 1101 fremember 
that this is how we proved Theorem [T| our main linear independence result). 
For the general case, the idea of the proof is to: 

(i) Split vertically E in two matrices Ei,E 2 . 

(ii) Apply the same argument (Theorem fTOll to Ei. 

(hi) Obtain the regularity of the pair {E2,X) from the linear independence 
of the {x + ytY"- 

(iv) Conclude from Proposition fTTl that the pair (E,X) is regular. 

We now explain how to carry out these four steps. For the first one, note 
that Nd+i = \E\ = k + l < (d + 2)/2. Let us apply Lemma fT^ to the sequence 
id^i)o<i<d+i beginning with Nq = 0. The lemma shows the existence of an 
index s G { 0 ,... , d} such that 

Ns+t ~ ^ d^d+i ^ d + 2 _1 1 ^11 

t “ d+ 1 “ 2 (d + 1 ) ~ 2^ 2 (d + 1 ) -2^Yt 

for every t = l,...,d+l — s. Let Ei be the matrix formed of the first r + 1 
columns of E, where r = d — s. The number of I’s in the last t columns 
of El is Nj- = W+t — Ng < {t + l)/2. In particular, A^{ < 1 and, since 
Nj-, Nl_i are integers, we get that Nj- + N^i < + l)/2j = t for fol t £ 

{2,...,r + l}. This matrix therefore satisfies the hypotheses of Theorem 1101 
and we conclude that Ei is regular. This completes step (ii). For step (iii), we 
note that since E 2 has s < d +1 columns the Birkhoff interpolation problem 
corresponding to the polynomials (x + ytY" with e* < s — 1 admits {E 2 ,X) 
as its pair. Since these polynomials are assumed to be linearly independent, 
E2 must contain at most s I’s and (E2,X) must indeed be a regular pair. 
Finally, we conclude from Proposition 1111 that (E,X) is regular as well. □ 


11 






Theorem 14 (First lower bound). Consider a polynomial of the form 

k 

Hi{x) = '^ai{x + Xi^ ( 8 ) 

i=l 

where the Xi are distinct real constants, the ai are nonzero real constants, 
and A: < (d + 2)/4. If Hi is written under the form 

l 

Hi{x) = '^l3i{x + yiy^ (9) 

i=l 

with Ci < d for every i then we must have I > k. 

Proof. Assume first that Ci < d for all i. By Theorem [13] we must have 
k + I > {d + 2)/2, so I > {d + 2)/2 — k > k. Consider now the general case 
and assume that I < k. We reduce to the previous case by moving on the 
side of ([8|) the k' polynomials in ([9|) of degree Cj = d. On the second side 
remains a sum of l — k' terms of degree less than d. We have on the first side 
a sum of terms of degree d. Taking possible cancellations into account, the 
number of such terms is at least k — k' > 0, and at most k + k'. We must 
therefore have {k + k') + {I — k') > (d + 2 )/ 2 , so I > {d + 2)/2 — k > k after 
all. □ 

In other words, writing Hi under form Q is exactly optimal when k < 
[d + 2)/4. We can give another lower bound of order d (with an improved 
constant) for a polynomial of a different form. 

Theorem 15 (Second lower bound). Let H 2 € be the polynomial 

H 2 {x) = {x + l)'^'*'^ — If H 2 is written under the form 

l 

H2{x) = ^Pi{x + yif^ 
i=l 

with Ci < d for every i then we must have I > (d — l)/ 2 . 

Proof. This follows directly from Theorem [13] after replacing d by d+l in ([7]). 
Since k = 2 we must have 2 + 1 > (d + 3)/2, i.e., I > {d— l)/ 2 . □ 

This result shows that allowing exponents Ci > d can drastically decrease 
the “complexity” of a polynomial since H 2 can be expressed as the difference 
of only two (d+l)-st powers. Such savings cannot be obtained for all polyno¬ 
mials. Indeed, the next result, which subsumes Theorem M shows that no 
improvement is possible for Hi even if arbitrarily large powers are allowed. 
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Theorem 16 (Third lower bound). Consider a polynomial of the form 

k 

Hi{x) = '^ai{x + Xi^ (10) 

i=l 

where the Xi are distinct real constants, the ai are nonzero real constants, 
and A: < (d + 2)/4. If Hi is written under the form 

l 

Hi{x) = '^l3i{x + yiy^ ( 11 ) 

i=l 


then we must have I > k. 

Note that the exponents Cj may be arbitrarily large. 

Proof. Let n be the largest exponent e, which occurs with a coefficient /?, 7 ^ 0. 
The case n < d is covered by Theorem [T31 so we assume here that n > d. In 
equation (ttH), let us move all the n-th powers from the right hand side to 
the left hand side, and the k degree-d terms of Hi from the left hand side 
to the right hand side. Applying Theorem [T2] to this identity shows that 
k + l > (n + 2)/2. Hence I > {n + 2)/2 — k > k. □ 

Remark 17. The lower bound for H 2 in Theorem\T^is essentially optimal. 
More concretely, it is optimal up to one unit when d is even, and exactly 
optimal when d is odd. 

Note indeed that by a change of variable, representing H 2 is equivalent to 
representing the polynomial H^{x) = (x + 1)'^^^ —(x—1)'^'*'^. If we expand the 
two binomials in H^ the monomials of degree d +1 — j wih even j cancel, and 
we obtain a sum of \^^~\ monomials. See ProvositionllM for a generalization 
of this observation. In fact, with the same number of terms we can represent 
not only H 2 but all polynomials of degree d: see Provosition 1 1 dl below. 

The consideration of H 3 also shows that Theorem\J\is optimal up to one 
unit when d is even, and exactly optimal when d is odd. Indeed, we have just 
observed that there is a linear dependence between the 2 + 1 " polynomials 
{x + 1)'^+^, (x — 1)'^+^,x^,x‘^“^,x'^“^,_ 

Ifd is odd, the number of polynomials of degree less than j in this sequence 
is Uj = [j/2\ for j < d + 1; moreover, nd +2 = 2 + (d + l)/2 = (d + 5)/2. 
Hence nj + n^+i = j for j < d; moreover, Ud+i + nd +2 = d + 3. 

If d is even, the number of polynomials of degree less than j in this 
sequence is Uj = \j/ 2 '\ for j < d + 1; moreover, nd +2 = 2 + (d + 2)/2 = 
(d+6)/2. Hence nj +nj+i = j + 1 for j < d; moreover, Ud+i +nd +2 = d+4. 

A simple construction shows that all polynomials of degree d can be 
written as a linear combination of [(d + l)/ 2 ] powers. 
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Proposition 18. Every polynomial of degree d can be expressed as 

Yfi=i + ViY" with I < |■(^^ + l)/ 2 ]. 

Proof. We use induction on d. Since the result is obvious for d = 0,1 we 
consider a polynomial / = 'f 2 i=o degree d > 2, and we assume that 

that the Proposition holds for polynomials of degree d — 2. We observe that 
g ■= f — ad{x + {ad-i/dad)Y has degree < d — 2. Applying the induction 
hypothesis to g we get that g = Yli=i + ViY^■: with V < [(d — l)/2]. 
Hence, setting I = V + 1 , fdi = a^, yi = ad-i/{dad) and e; = d, we conclude 
that / = Yfi=i Pi{x + and / < 1 + |"(d - l)/2] = |"(d+ l)/2]. □ 

Theorem [El therefore shows that of all polynomials of degree d, H2 is 
essentially (up to a small additive constant) the hardest one. 


5 Changing Fields 


Some of the proof techniques used in this paper, and even the results them¬ 
selves, are specific to the field of real numbers. This is due to the fact that 
certain linear dependence relations which cannot occur over R may occur if 
we change the base field. For instance, over a field of characteristic p > 0 
we have {X -|- 1)^ — —1 = 0 for any k (compare with Theorem [T] for 

the real case). The remainder of this section is devoted to a discussion of 
the complex case. We begin with an identity which generalizes the identity 
(x -|- 1)^ — {x — 1)^ — 4x = 0. 

Proposition 19. Take k E Z+ and let be a k-th primitive root of unity. 
Then, for all d E TX and all p, £ C the following equality holds: 

j = \ i= — l (mod k) ^ 

0<i<d 

Proof. We observe that 



To deduce the result it suffices to prove that equals k if i = 

— 1 (mod k), or 0 otherwise. Whenever i = —1 (mod k) we have that = 
(^*+i)i = I for all j E {1,... , /c}. For i ^ —1 (mod k), the summation of 
the geometric series shows that 






^*+1 - 1 


= 0 . 


□ 
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For any d,k & Z^, Proposition [19] yields an identity of the form 
k I 

+ ( 12 ) 

i=i i=i 

where the xj are distinct complex constants, the Cij,f3j are nonzero complex 
numbers, I = |_^^J and ej < d for all j. Note the sharp contrast with 
theorems M and M In particular, Theorem 1141 gives an Q(d) lower bound 
for polynomials of the form + Xi)'^ over the field of real numbers 

(the implied constant in the 12 notation is equal to 1/4). But in (11211 we 
have k.l < d + 1, so k < y/d+1 or I < y/d+1. We conclude that no 
better lower bound than fl(\/d) can possibly hold over C for the same family 
of polynomials, at least for arbitrary distinct Xj’s and arbitrary nonzero Oj. 
Such a Q{y/d) lower bound was recently obtained for the more general model 
of sums of power of bounded degree polynomials: see Theorem 2 in m 
We leave it as an open problem to close this quadratic gap between lower 
bounds over R and C: find an explicit polynomial / E C[X] of degree d which 
requires at least k = kl{d) terms to be represented under the form 

k 

f{x) = y^^ai{x + Xiy\ 

i=l 

With the additional requirements ei < d for all i, the “target polynomial” 
H 2 {x) = (x +1)'^'*'^ — from Theorem [T5] looks like a plausible candidate. 
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Appendix: a worked out example 

We illustrate the proof method of Theorem [121 (more than the result itself) on a 
small example: we show with this method that there is no identity of the form 

ctix^ + 0 : 2 ( 2 : + 1)^ + 03 (x + 3)^ = I3\x^ + P2(,x + 1) + + 3)^ (13) 


except if the coefficients ai. Pi G R are all 0. The corresponding interpolation 
problem is: 

g(0) = 0, 5(1) = 0, 5(3) = 0, 5^"^(0) = 0, 5^"^(1) = 0, 5^'^(3) = 0 (14) 

where 5 G R 5 A]. The set of knots is X = {a:i,a: 2 ,X 3 } = {0,1,3} and the interpo¬ 
lation matrix is 

/ 1 0 0 1 0 0 \ 

£ 1 = 1 0 0 0 1 0 . 

\ 1 0 0 1 0 0 / 

This matrix is not order regular. Indeed, the pair {E,Y) where Y = { — 1,0,1} is 
not regular. This follows from the identity 

{x 1)^ — (x — 1)^ — 4a: = 0 


which was pointed out earlier in the paper. We will nonetheless show that the 
pair {E,X) is regular. Toward this, let us split E in the middle to obtain the two 
matrices 


El 


1 0 0 \ 
10 0 
10 0 / 


and 


E 2 


1 0 0 \ 
0 10 
100 / 


The first matrix is regular since all its I’s are in the first column. The second matrix 
fails to be order regular for the same reason that E does, but it is easy to check 
that the interpolation problem h{0) = 0, h'{l) = 0, h{3) = 0 has no nontrivial 
solution in R 2 [X]. Hence the pair {E 2 ,X) is regular. It follows from Proposition [TT] 
that {E,X) is a regular pair, and the 6 polynomials in (1131) are indeed linearly 
independent. 

We conclude with a remark about Proposition [TT] In the proof of this result, 
we used the fact that the matrix A(E,X) of the linear system can be transformed 
by a permutation of rows into a matrix of the form 


AiEi,X) 

0 


AiE2,X) ) ■ 
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We point out that when the 6 interpolation constraints are listed in the same order 
as in (flS . A{E, X) is already in this form. In particular, the equations for the last 
3 constraints are: 

53 + a;i54 + 3:155/2 = 0, 

54 + 3^255 = 0, 

53 + 3:354 + 3:355/2 = 0 

and the matrix of this subsystem is indeed A{E 2 ,X). As to A{Ei,X), con¬ 
sider for instance the third constraint 5 ( 3 : 3 ) = 0. The corresponding equation 
is 'Yhi=o ^^9 3 /= 0- The first 3 coefficients are l,a: 3 ,a :|/2 and this is the last row 
of A(Ai,A:). 
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